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^^^^ ^^^^fc ne f ' rst com P lJter language was assem- 

¥ ^ bly language, which substitutes sym- 

bols chat almost look like English words 
for the t'i and zeros of the machine's 
"native" language. When you write in 
assembly language, you regularly by- 
pass the application software and com- 
municate directly with the CPU, BIOS, 
m JB^ and MS-DOS. By using assembly lan- 

guage, you begin to understand what is really happening inside 
the computer. 

Assembly language has other benefits, which are well known to 
professional programmers. It is fast— up to 100 times faster than 
Basic. It uses memory more efficiently than high-level languages, 
and it !s usually the best language for controlling the computer's 
I/O devices, especially the video screen. The main drawback— 
an often prohibitively long programming 
time-is manageable if you use assembly 
language sparingly. In fact, the best soft- 
ware combines high-level code with assem- 
bly language subroutines. 

Normally, to do any serious assembly 
language programming you have to buy a 
commercial assembler, which translates 
commands into the binary code understood 
by the computer. Conveniently, the De- 
bug.COM program that comes with MS- 
DOS provides a window into machine-level 
and assembly language programming. In 
this tutorial, I'll show you how to go into 
Debug and see how the computer stores 
data in numerical form. Then I'll offer some 
short assembly language programs that il- 

asscmblers. 

Debug Starts Here 

To begin, put your MS-DOS disk in drive A and type DEBUG. 
In a second or two, you should see a hyphen on the next line. 
The hyphen is Debug's prompt, and it means that Debug is waiting 
for your command. Debug expects you to press the enter key 
after each command, which you can type in either upper- or 
lowercase. 

Type in the letter r and press enter. You will see a three-line 
display. The chart Breakdown of Initial Debug Display explains 
the significance of the entries. Some of the numbers and letters 
might be different, but the organization of the display will be 
essentially the same. (In all of the figures accompanying this article, 
the only characters you type in arc those next to the Debug prompt 
and memory addresses [such as 16BA:0138], Everything else is 
commentary or a representation of what you should see on screen.) 

By typing in an r, you have asked Debug to show you the 
current contents of the CPU's registers. Registers are memory 
locations in the CPU (instead of in the computer's RAM). You can 
use some registers for almost any purpose; others are limited to 
specific uses. (If you don't understand how memory addresses 
work, see the sidebar "The Structure of Memory") 

The CPU in MS-DOS computers has 14 internal registers, each 
of which can hold 16 bits, or one word. The first four— AX, BX, 
CX, and DX— are called general-purpose registers; their use is 
largely up to you. Each of these four registers can also serve as 
two 8-bit (the equivalent of two 1-byte) registers. The top byte 
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WHAT VOU NEED: MS-DOS and Bask. 
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You don't need to be a pro to write programs in assembly language. 



Using the MS-DOS Debug program as your starting point, 

. 

you can master the basics in record time. 



of AX is called AH; the low byte is called AL. You determine 
when to use these areas as 16-bit registers and when to use them 
as 8-bit registers. 

In the initial Debug display, the contents of each general-purpose 
register is 0000. The same value is stored in three pointer registers: 
BP, SI, and Dl. The base pointer (BP) register generally serves 
as a placeniark to help you manipulate a complex data structure 
called a stack frame. 

The source index (SI) and destination index (DI) registers are 
mostly for moving large blocks of memory, which are called strings 
in assembly parlance regardless of whether they contain textual data. 

The final register in the first row of the Debug display is the 
stack pointer (SI'), an in-memory data structure that serves many 
purposes. Whenever a program branches to a subroutine, it stores 
the return address on the stack. Programmers also use the stack 
to temporarily store the contents of registers and sometimes to 
pass values from one routine to another. 

The first four registers in the second row of the display arc the 
segment registers. Because of the way memory is organized in an 
MS-DOS computer, two registers are required to designate mem- 
iry locations. The segment registers point to a large chunk of 
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memory; the specific address within that chunk is kept in one of 
the other registers. 

The fifth register in the second row is the instruction pointer 
(IP). It always contains the address of the next instruction to be 
executed, in the same way that Basic always keeps track of which 
line number should be interpreted and executed next. 

The contents of all 13 numeric registers are displayed in hexa- 
decimal (hex) format. Assembly language programmers rarely use 
decimal numbers. Instead, they use cither the binary (base 2) or 
hex (base 16) number system. 

The last part of the second line in the initial Debug display 
contains eight two-letter abbreviations showing the state of the 1§> 
important bits in the flag register (see the tahle Status Flags). The j| 
CPU has a special 16-bit register to keep track of these status g 
flags. They are updated to give information about the results of ^ 
many assembly language operations and can be tested to determine .g, 
whether a program should branch to a new set of instructions, § 
The flags are the basis of all conditional tests in assembly language. | 
For example, the zero flag might change from NZ to ZR to show _| 
that the result of a mathematical operation is zero. Not all flag- 
register bits arc used. 
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Status Flags 



AH flags off: NV UP DI PL NZ NA PO NC 
All flags on: OV DN EI NG ZR AC PE CY 



Overflow: 

Direction: 

Interrupts: 

Sign: 

Zero: 

Auxiliary carry: 
Parity: 



NV = no overflow 
UP = increment 
DI = disabled 
PL = plus 
NZ = not zero 
NA = no auxiliary 
carry 

PO = odd parky 
carry 



OV = overflow 
DN = decrement 
El = enabled 
NG = negative 
ZR = zero 
AG = auxiliary 
carry 

PE = parity even 
CY = carry 



Video Output 

So far, you've done a lot of looking but 
no programming. Before writing a short 
program, you need to know one more De- 
bug command. If you type in an r. Debug 
shows you the initial register display. But 
if you type in an r plus the name of a 
register, Debug displays the contents of the 
register and lets you enter a new value. 

For example, if you type: 



Debug displays AX 0000 on one line and 
a colon on the next. If you enter 11 1 i , 
Debug will change the contents of the AX 
register to 1 1 1 1 hex. Remember that num- 
bers typed into and displayed by Debug 
are in hex format. 

Now type in the letter a to invoke the 
Assemble command. Debug answers by 
displaying a four-digit number, a colon, 
and 0100. The cursor appears without a 
prompt and waits for you to enter program 
statements. 

The numbers displayed are the segment 
and offset of the current location in mem- 
ory. The segment addresses you see will 
probably differ from those in the figures; 
they are determined by the MS-DOS ver- 
sion you use and whether you have any 
memory-resident utilities, RAM disks, or 
print spoolers. The number after the colon, 
which is called the offset, should be 0100, 
All .COM programs begin at address 100 
hex of their memory segment, and Debug 
can create only .COM programs. If the 
offset you see is not 0100, press enter and 
type: 

alfJO 

to start assembly at location 0100 hex. 

The first program is exceedingly simple. 
Type in the three instruction lines in Figure 
1 . Use the tab key to space between col- 
umns and press enter at the end of each 
line. Press the enter key once more, and 
you should be back at Debug's hyphen 
prompt. To check your work, enter: 



ulOO I 7 
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which tells Debug to "unassemble" 7 bytes 
starting at address 100 hex, (The letter / 
stands for length.) Your screen should look 
like the one shown at the bottom of Figure 
1. Type in an r to produce the register 
display. It should look the same as your 
original register display, except that the last 

gram. 

A Basic Vocabulary 

Every line in a Debug assembly language 
program has two parts. The first part of 
the line always contains a two-, three-, or 
four-letter command. These commands arc 
called mnemonics, or memory words, be- 
cause they represent exactly one CPU in- 
struction and are easier to remember than 
groups of binary digits. They are sometimes 
called op-codes because they represent 
CPU operations. 

After the op-code are one or two oper- 
ands {or none). The number depends on 
the particular command and the types of 
information on which it operates. You may 
conclude the line with a semicolon followed 
by a remark. 

The first op-code in the program is MOV, 
a mnemonic for the Move command. It is 
one of the most common mnemonics in 
any assembly language program. You use it 
to move data into registers, into memory, 
from register to register, and between reg- 
isters and memory locations. Its name is 
technically incorrect, since it only copies 
information from one place to another. Like 
Basic's Let statement, it leaves the value in 
the source operand intact. 

The MOV operator is always followed 
by two operands: the destination and the 
source. In the Figure 1 program, the first 
line tells the CPU, "Move the value 1 into 
the AX register," In a similar manner, the 
second line loads the value Z into the BX 
register. The two lines are analogous to the 
Basic statement: 

AX= 1: BX = 2 

It might be obvious to you that the third 
line is an addition instruction; what is not 



so obvious is where the information origi- 
nates and where the result is stored. Avoid 
the temptation to read the line as, "Add 
AX and BX." Instead, read it as, "Add the 
value in the BX register to that in the AX 
register." When you think of it that way, 
the expected result should be clear: The 
value in BX will be added to the value in 
AX, and the result will be left in the AX 
register. This line is analogous to the Basic 
statement: 

AX=AX+BX 

One advantage of programming in Debug 
is that you can watch the program execute 
step by step. Type in the letter / (for the 
Trace command) three times and watch 
the AX and BX registers. The Trace com- 
mand tells Debug to execute an instruction 
and display the registers again. It lets you 
watch each register being loaded with 
the appropriate value and the final result 
being placed in the AX register. Your 
screen should resemble the one shown in 
Figure 2. 

Saving and Running a Program 

Although the first program doesn't ac- 
complish much, it helps you learn the ru- 
diments of Debug's register display, assem- 
bling and unassembling a program, and 
tracing a program to watch it execute. Be- 
fore you can use Debug to create a program, 
you must be able to save the program to 
disk, so that you can run it from the MS- 
DOS prompt. 

The program in Figure 3, while simple, 
introduces several new concepts. Before 
starting, you should clear the last program 
from Debug's memory. Type in the letter 
q to quit Debug, then at the MS-DOS 
prompt type DEBUG. 

Shuffling data around in registers is a 
useful skill that produces no output to watch 
or evaluate. The Figure 3 listing, however, 
prints a message on screen— in this case, 
the phrase, "Hello world!" The program 
adds two new skills to your repertoire: sav- 
ing a program on disk and communicating 
with MS-DOS. 

The CPU knows nothing about video 
screens, disk drives, keyboards, modems, 
or printers. To perform I/O functions, a 
program must either manipulate the com- 
puter's hardware directly, which is a com- 
plicated and difficult task, or seek help from 
MS-DOS or the BIOS, 

To write programs of your own, you need 
a list of MS-DOS routines and an expla- 
nation of how to use them. You can find 
this information in the programmer's ref- 
erence manual for any MS-DOS computer. 
The MS-DOS routines are the same re- 
gardless of which computer you own; the 
BIOS routines are essentially the same for 
all IBM PC/XT/AT compatibles. 

The first line of the program places tb 
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Breakdown of Initial Debug Display 

■ Concents of Stack 




NV UP EI PL NZ NA PO NC 



■ 

Contents of ^J / 
segment registers 



A A ■■■ A 

J 



169G0100 0000 



T 



Segment, offset, 
and hex code for 
next instruction 



CurreJstatcof' ^.^ 
Instruction pointer — I ■ flag bits ■ 

V address of next instruction 

to be performed - .'■*. •• . ,\ ■ 



ADD [BX + SI],AL 



Mnemonic form of 



next instruction 




value 09 hex in the AH register (the top 
byte of the AX register). Later, the program 
asks for an MS-DOS service to print a line 
on the screen. In general, you request each 
MS-DOS service by putting its number in 
AH, loading the other registers with the 
information that MS-DOS will need, and 
then making an MS-DOS request. In this 
case, you use MS-DOS service 9 (display 
string). 

The second line loads the DX register 
with the value 1 20 hex. To display a string, 
MS-DOS must know where to find the 
string in memory, and MS-DOS service 9 
expects you to put the string's address in 
the DS and DX registers. Since the DS 
register is already set to represent the mem- 
ory area that the program will use, you need 
to set only the DX register. The value 1 20 
hex will be past the last instruction in the 
program and is a convenient location to 
store the string. 

The third line of the program invokes 
MS-DOS and sends it your request. The 
INT mnemonic stands for interrupt — a term 
that refers to the CPU's ability to be in- 
terrupted by events in the real world. Every 
time you press a key, the keyboard hard- 
ware interrupts the CPU, which stops what 
it is doing, gets the code for the key you 
pressed, and stores the code in its type- 
ahead buffer. The CPU is also interrupted 
PC Resource November 1987 



Figure /, Type in the three program lines near the center of the figure. The simple program 
demonstrates how to use MOV and ADD, two common assembly language commands. 

The program as you type it. 



Axle bug 
-al00 



169C 
169C 
169C 
169C 



0100 
0103 
0106 
0108 



add 



ax,l 
bx,2 
ax.bx 



The program as Debug unassembles it. 
-ul00 17 

169C:0100 B80100 MOV AX, 0031 

169C:0103 BB0200 MOV BX.0002 

169C:0106 BIDS ADD AX.BX 



End 



Figure 2. Debugs Trace 
registers. 



lets you see how each line in Figure 1 changes the CPU's 



CX-0000 
SS-169C 



cx=«0 

SS-169C 



AX-0000 BX-0000 
DS=169C ES-169C 
16%: 01 00 BS0100 



AX-000I BX-0000 
DS=169C ES-169C 
169C:0103 BB0200 
-t 



™ gZfgSc ss-W 

169C:0106 0108 
-t 

AX-0003 BX-0002 
DS-169C ES-169C 
]69t:0108 0000 
DS; 



BX-0000 SP-FFEE 
CS-169C IP-0100 
I AX, 0001 



MOV 



CS-169C 



SP-FFEE 
IP-0103 



BX.0002 



BP-0000 S 1=0000 01=0000 
NV UP EI PL NZ NA PO NC 



BP-0000 SI-0000 DI-0000 
NV UP EI PL NZ NA PO NC 



DX-0000 SP-FFEE 
C5-169C 1P-0106 
ADD AX , BX 



CX-0000 DX=0000 SP=FFEE 
SS=169C CS-169C IP-0108 
ADD [BX+SI],AL 



B nvTei 



NA D P0 



BP-0000 SI-0000 DI-0000 
NV UP EI PL NZ NA PE NC 



End 
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. he best way 
to learn, though, 
is to study 
and experiment 
with short programs. 



return and line feed characters with the 
string. The carriage return is ASCII char- 
acter 13 {OD hex); the tine feed is ASCII 
character 10 (OA hex). The Enter command 
tells Debug to enter the text of the message, 
followed by a OD hex byte, a OA hex byte, 
and the dollar sign that MS-DOS requires 
to terminate strings. 

The first section in Figure 5 is repeated 
three times, each time with a different loop- 
ing mechanism. The command: 

a 130 

tells Debug to start assembling at address 
130 hex, which is the target of the JMP 
command at the beginning of the program. 

[n the first example, the DX register is 
loaded with the string address, the CX 
register is loaded with the loop count, and 
the AH register is loaded with the MS- 
DOS service number. The program calls 
INT 21, as before, to request that MS- 
DOS print the string. The next line of the 
program, loop 138, uses a special looping 



Video by Numbers 

You can manipulate the video screen by 
changing the bits within the byte that 
controls how each character is displayed. 
This byte is called the screen attribute 
byte, and it immediately precedes the 
byte that contains the character code in 
memory. Both are stored in a special sec- 
tion of RAM that is set aside for video. 

The program in Figure 6 prompts you 
to enter a screen attribute byte in hexa- 
decimal. If you have a color monitor, 
consult the table below to find the 3-bit 
equivalents for the colors you want for 
the foreground and background. Insert 
the color values in the proper places in 
die byte, which are shown in the chart 
Screen Attributes. For the blink and high- 
intensity features, write 1 to turn a feature 
on and zero to turn it off. Then convert 
the entire byte to hexadecimal. 

With monochrome-only graphics adapt- 

Screen Attributes 



ers, color value 000 is black and 1 1 1 is 
white. Most of the other values represent 
shades of gray, tf the background color 
value is 100, displayed characters will be 
underlined. 

With color adapters, the following color 
values are possible: 

Color 

Black 
Blue 
Green 
Cyan 
Red 

Magenta 
Brown 
Light gray 

In this scheme, intensified black ap- 
pears as dark gray, intensified brown 
is yellow, and intensified light gray is 
white. Q 




Blink 
bit 



Background 
color 



High 
intensity 



Foreground 
color 
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mechanism built into the CPU. The Loop 
instruction really tells the CPU, "Reduce 
the value in CX by t. If CX is not zero, 
jump to the address indicated in this in- 
struction. If CX is zero, go on to the 
next instruction." In other words, Debugs 
Loop command is much like Basic's Next 
statement. 

The second version of the program uses 
a different technique to control the loop. 
This time, the loop counter is loaded into 
the BX register (you could use CX, SI, DI, 
or BP the same way). At the end of the 
loop, the instruction: 

dec bx 

tells the CPU to decrement (reduce by 1 ) the 
value in the BX register. The loop should 
continue until BX is zero, at which point the 
program will set the zero status flag. 
The next instruction: 

jnz 138 

tells the CPU to jump to location 138 hex 
only if the zero flag is not set. Because the 
zero flag is set only if BX has been dec- 
remented to zero, the loop runs 20 times. 
These two lines are analogous to the Basic 
statements: 

BX = BX - 1 : IF BX <> THEN GOTO 138 

One advantage of this kind of loop is that 
you can nest it in another loop controlled 
by the CX register. 

The final version of the program uses 
other instructions to control the loop. It 
loads the loop counter into the CX register 
again, but this time it decrements CX at 
the end of the loop. Then the instruction: 

jcxz 13f 

tells the CPU, "Jump to address 13F hex 
if CX contains zero. Otherwise continue to 
the next instruction." The following line 
has an unconditional jump back to the 
top of the loop. The CPU sees the JMP 
instruction only if CX has a value other 
than zero. 

The three lines controlling the end of the 
loop in the third example are similar to the 
Basic statements: 

CX = CX -I: [FCX = 0TI-[GN GOTO 131- 
ELSE GOTO 138 

The JCXZ instruction (jump if CX is zero) 
is often used in complex looping structures 
to make 3 program check for the end of a 
loop in the middle of a block of code. 

After you have assembled and saved all 
three versions of the program, try running 
them from the MS-DOS prompt to make 
sure they all work. If they don't, trace 
through them (remember to use the Go 
command when you come to an INT in- 
struction) to see where you made a mistake. 

You might be disappointed at the speed 
of the three programs. Assembly language 
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is supposed to be fast, yet these programs 
seem to run no faster than Basic's Print 
command. The culprit is the MS-DOS dis- 
play-string service, which displays one char- 
acter at a time and checks the keyboard to 
be sure you aren't pressing Ctrl-Break or 
Gtrl-C to stop the program. Most com- 
mercial programs avoid the MS-DOS dis- 
play services because they are so slow. 

Something Useful 

The program listing in Figure 6 is much 
longer than the other programs, but you 
should be able to understand it without 
much difficulty. It uses some new MS-DOS 
services and one of the BIOS services. 

The program prompts you for two hex 
digits and interprets them as screen attri- 
butes. It then clears the screen, sets the 
new attribute, and returns to MS-DOS. 
The new screen attributes stay in effect 
until another program changes them. Con- 
sult the sidebar "Video by Numbers" to 
learn how to manipulate bits in the screen- 
attribute byte and translate the result 
into hex. 

The first step in writing a program like 
this with Debug is organizing the program 
and deciding on addresses for various parts 
of it. Since you need to know the addresses 
before typing in the program, you must 
guess how long each section will be. Inev- 
itably, there will be wasted space— parts of 
the program won't be used at all— but the 
entire screen attribute program will take up 
only 193 bytes on disk, which is less than 
the minimum 1,024 bytes that MS-DOS 
allocates to each disk file. Thus, you don't 
have to worry about conserving bytes. 

My outline for the program appears at 
the top of Figure 6. The program prints a 
prompt, waits for the user to enter two hex 
digits and a carriage return, clears the 
screen, sets the requested attributes, and 
ends. I've broken the Debug assembly pro- 
cess into logical blocks and added a com- 
ment to eacli line so you can see what is 
happening each step of the way. 

The program begins at address 100 hex 
by using MS-DOS service 9 to display the 
prompt string. It then calls MS-DOS ser- 
vice 1 (read keyboard and echo) at address 
107 hex to accept a keystroke. The user 
sees the typed character on screen but 
cannot backspace or edit characters. 

Service 1 returns the keystroke in the 
AL register. Next, at address 10B hex, the 
program uses the Call instruction to send 
control to a subroutine. Debug's Call is 
much like Basic's Gosub instruction; when 
you invoke it, the current address is saved 
on the stack and the program jumps to the 
new routine. When the routine ends, it uses 
an RET instruction, which is analogous to 
Basic's Return, to go back to the main 
program. 

The subroutine, which I'll explain in a 
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Figure 6. This Debug scrip! creates an assembly language program that uses MS-DOS and 
BIOS services to change the display attributes. 

Program Outline: 

100: Print prompt 
Get keystroke 

Call convert routine 
If carry flag ts set, start again, 
else store key value in BH 
Get keystroke 

Call conuert routine 
If carry flag is set, start again, 
else add key to value in BH 
Get keystroke 

Compare to carriage return 
[f different, start again 
Use Video BIOS routine to clear screen 
End program. 

150: Convert routine -- Keystroke in AL converted to binary: 
If keystroke ts less than '0' then go to "Invalid" 
If keystroke is less than or equal to '9' then go to "Set value" 
Force keystroke to uppercase 

If keystroke is less than 'A' then go to "Invalid" 
If keystroke is greather than 'F' then go to "Invalid" 
Add 9 to key value 

170: Set valuet 

Erase top four bits of key value 

Reset carry flag 

Return 



180: 



Inval id: 
Set carry flag 
Return 



190: Prompt message 



A>debug 
-a 103 
]69C:0100 
169C:0103 
169C:0105 



169C 
169C 
169C 
169C 
169C 
169C 
169C 



0107 
0109 
01 0B 
01 0t 
0110 
0112 
0114 



169C:0116 
169C:0118 
169C:011B 
169C:011D 

169C:011F 

169C:0125 
1690:0127 
169C:012A 
169C:012D 
169C:012F 



169C 
169C 
169C 
159C 



0131 
0134 
0136 
0133 



169C:013A 
169C:013C 

-a 150 



169C 
169C 
169C 
I69C 
169C 
169C 
169C 
169C 
169C 
169C 
169C 
169C 



0150 
0152 
0154 
0156 

sm 

015A 
015C 
015E 
HI 60 
0162 
0164 
0166 



-a 170 

169C:0170 

169C:0172 



mov 
mov 
int 

mov 

int 

call 

jc 

mov 

shl 

mov 

int 
call 
ic 
add 

int 
cmp 
jne 

mov 
mov 
mov 
mov 
int 



mov 
mov 
int 

int 



cmp 

jb 

cmp 

jbe 

and 

cmp 

jb 

cmp 

add 



and 
clc 



dx,19£ 
ah, 9 
21 

ah,l 

21 

150 

100 

Cl,4 

al ,cl 

bh.al 

21 
150 
100 
bh.al 

21 

al,0d 



al,0 
cx,0 
dx,184f 
ah, 6 
10 

dx,0 
bh,0 
ah, 2 
10 

20 



al ,30 

180 

al ,39 

170 

al.df 

a 1 , 41 

180 

al ,46 

180 

al,9 

170 



al ,0f 



;DX - string address 

;D0S service: Display String 

[Call DOS 

;D0S service: get keystroke 
;Call DOS 

;Convert keystroke 
;If error, start over 
;Amount to shift 
;Hove to top of byte 
;flnd save value 

[Get another keystroke 
;Convert it 
;lf error, start over 
;Else add to 1st value 

;Get another keystroke 
Carriage return? 
;No -- start over 

;Else scroll entire window 
;0,0 is top-left corner 
;18h,4fh - 24,79 -- bottom corner 
;BI0S service: scroll window up 
;Call Video BIOS 

;0,0 is top -left corner 
;Select page 

;B10S service: set cursor position 
;Call Video BIOS 

;Return to DOS 



;Is key <. '0' ? 

iVes mark as invalid 

;ls key < '9' ? 

;Yes -- set value 

;Else force to upper-case 

; Is key < 'A' ? 

;Yes -- mark as inval id 

;Is key > 'F' ? 

;Yes -- nark as inval id 

;Else add offset 

; and set value 



;Throw away top 4 bits 
-.Clear error flag 



Figure 6 continued I 
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~Pr Figure 6 continued 

169C:£)173 ret ' ; and return 

169C:0174 

-a 130 

I69C:0180 stc ;Set error indicator 

1690:0181 ret ; and return 

1696:0132 

-e 190 0d 0a "Enter two hex digits for screen attribute —i I" 
-d 190 1 40 

169C:0190 0D 0A 45 6E 74 65 72 20-74 77 6F 20 68 65 73 20 ..Enter two hex 
169C:01A0 64 69 67 69 74 ?3 20 66-6F 72 20 73 63 72 65 65 digits for scree 
169C:01B0 6E 20 61 74 74 72 69 62-75 74 65 20 3D 3D 3E 20 n attribute 
169C:01C0 24 00 00 00 00 00 00 00-00 00 00 00 00 00 00 00 S 

CX C 0000 

:el 

-n screen.com 
Urtting 00C1 bytes 

-q 

End -+ 



moment, is set up to either convert the 
keystroke from ASCII to hex or to set the 
carry flag in the status register if the key- 
stroke isn't in the appropriate range. Be- 
cause the carry flag is easy to manipulate 
and test, programmers often use the carry 
flag to pass succcss-or-failure messages be- 
tween routines. The line after the Call uses 
a JC instruction (jump only if the carry flag 
is set). 

Generally, programmers test status flags 
by having the program jump to a special 
section of code depending on the condition 
of one or more of the flag bits. These 
conditional jumps resemble Basic's If . . . 
Then Goto statement. There are 30 kinds 
of conditional jumps in assembly language, 
although many are synonyms for each 
other. If you are tracing through a program 
and Debug seen is to have changed the 
condition for jumping, don't be alarmed. 

if the carry flag was not set, the ASCII- 
to-hex conversion subroutine was success- 
ful. In this case, the AL register contains 
a value between 00 hex and OF hex to 
indicate which key was pressed. Since this 
is the first of two hex digits that the program 
expects the user to entfcr, what you really 
need is a value between 00 hex and 0F0 
hex; that is, the program has to shift the 
value from the lower half of the byte to the 
upper half. 

There are two ways to perform this shift. 
The program can multiply the byte in AL 
by 10 hex of it can shift every bit in AL 
four positions to the left. The second 
method is faster and easier to program than 
the first. 

The instruction at address 1 10 hex moves 
a count of 4 into the CL register. The 
instruction: 

shl al,cl 

in the next line tells the CPU to shift the 
value in AL a number of positions left 
equivalent to the value in CL. During each 
step of the shift operation, the current value 
in the carry flag is discarded, the bit farthest 
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to the left of the operand (AL, in this case) 
is put in the carry flag, she other bits in 
the operand are each moved over one place 
to the left, and a zero bit is inserted into 
the position farthest to the right of the 
operand. The process sounds more com- 
plicated than it is. Each shift to the left is 
identical to multiplying the operand by 2, 
so the four left shifts in the program are 
equivalent to multiplying by 2 to the 4th 
power, or 16. 

After the value in AL has been shifted 
to the top of the byte, the result is copied 
to the BH register. There is a reason for 
selecting this particular register, which I'll 
explain later. 

Nothing you have done so far has altered 
the original value of 1 in the All register, 
so the program can ask for another key- 
stroke simply by invoking INT 21 again. 
Once again, it calls the conversion subrou- 
tine to handle the keystroke and checks 
the carry flag for an error. 

If the conversion subroutine doesn't re- 
port an error, the new value in AL {which 
is between 00 hex and OF hex) is added to 
the number that was stored in BH. Then 
a final call to MS-DOS service 1 (at address 
1 1 F hex) gets a third keystroke, which 
should be a carriage return. The program 
then uses the CMP instruction at address 
121 hex to compare the keystroke in AL 
with the value for a carriage return. The 
following line uses another conditional 
jump, JNE (jump if not equal), to restart 
the program if the user did not press enter. 

Once the program gets to address 125 
hex, the user has typed two hex digits 
followed by a carriage return, and the pro- 
gram has converted the digits to binary form 
and stored them in the BH register. It is 
now time to clear the screen and set the 
new attributes. 

There are no MS-DOS services to clear 
the video screen, set attributes, or position 
the cursor. If you use MS-DOS's 
ANSI. SYS console driver, you can print 
special sequences of characters that do such 



things, but they tend to be slow, and they 
won't work if you haven't installed the 
ANSI. SYS file. On all IBM PC/XT/AT- 
compatible computers, INT 10 calls a set 
of BIOS ROM routines that control the 
video screen, change modes, position the 
cursor, print graphics, and so on. 

Service 6 of INT 10 scrolls any portion 
of the active screen up a specific number 
of lines (service 7, which scrolls down, 
would serve the purpose just as well). You 
call it by placing the service number (6) in 
the AH register, placing the number of lines 
to scroll in AL, setting values for the top- 
left and bottom-right comers of the scroll 
window in CX and DX, and putting the 
attribute to be used in BH. The program 
has already stored the attribute in BH, so 
you can ignore that step for now. 

At address 125 hex, the program places 
a zero in AL to signify that the entire 
window should be erased. At 127 hex, mov- 
ing a zero into the CX register is the same 
as loading a zero into CH and CL to tell 
the BIOS that the top edge of the window 
is in row zero, column zero. (Screen rows 
and columns are always counted from zero, 
not 1.) The value placed in DH and DL 
should be 18 hex {24 decimal) and 4F hex 
(79 decimal), respectively, to indicate that 
the bottom edge of the window is in column 
79 of row 24. Instead of using two separate 
instructions to place those values, the pro- 

mov dx,184F 

Finally, the program places the service 
number in AH and calls INT 10. 

The video BIOS routine responds by eras- 
ing the entire screen, using space characters 
and the new attribute byte in BH. However, 
it does not move the cursor to the top of the 
screen; you have not yet completed the as- 
sembly language equivalent of the Basic 
clear-screen (CLS) command. 

Video BIOS service 2 positions the cur- 
sor. To call it, you must make the program 
place the requested cursor position in DH 
and DL, the video page number in BH, 
and the service number (2) in AH. Since 
you want the cursor to be in the top left of 
the screen, the program loads the DX reg- 
ister with zero. Unless another program has 
impolitely left the video area confused, the 
current video page will be zero, which the 
Figure 6 program places in BH. Then it 
calls INT 10 again to put the cursor at the 
top of the screen. The last step is to return 
to MS-DOS through INT 20. 

Converting Numbers 

Most of the remainder of the program in 
Figure 6 is concerned with converting a 
keystroke to a hex digit. If the user presses 
5, for example, the AL register will contain 
35 hex, which is the ASCII value of 5. You 
need a subroutine to change 35 hex to 05 
PC Resource November 1987 



hex and to make sure the user typed a valid 
hex digit. 

The subroutine that does this begins at 
address 150 hex with a series of tests and 
conditional jumps. If the value in AL is (ess 
than 30 hex, you don't have a valid hex 
digit. If it is between 30 and 39 hex inclu- 
sive, it is a decimal digit and can be con- 
verted directly to binary. The JB (jump if 
below) instruction at address 152 hex 
means "Jump if the left operand was less 
than the right operand in the last test." The 
JBE operand two lines later means, "Jump 
if it was below or equal." 

IF the keystroke was not between zero 
and 9, it may be one of the alphabetic hex 
digits. It may also be in either upper- or 
lowercase. A look at any chart that converts 
ASCII characters to binary reveals only one 
difference between upper- and lowercase 
letters: Bit 5 is turned on in lowercase 
letters and turned off in uppercase letters. 
The instruction: 

and al,df 

at address 158 hex uses the logical And 
operation to check that bit 5 is turned off. 
The same technique works in Basic, where 
it is often expressed in a line like: 

CH$ = CHRS<ASC(CHS) AND &HDF) 

Next, the Figure 6 program performs two 
more tests to see if the keystroke in AL is, 
indeed, be ween the letters A and F. If it 
is, the program adds 9 to the character 
before jumping to address 170 hex. Adding 
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the 9 is another bit of trickery (pun in- 
tended). The ASCII values for the letters 
A-F are 41-45 hex. By adding 9, you con- 
vert them to 4A-4F hex. The second digit 
of the resulting value is now correct. 

If the character is valid, the AL register 
now holds a value either between 30-39 hex 
or 4A-4F hex. Since you want a result be- 
tween 00-0F hex, you need only change the 
first half of the byte to zero by using another 
And operation at address 170 hex. Then the 
CLC command clears (turns off) the carry 
flag and an RET instruction returns control 
to the main part of the program. 

If the user does not type a valid character, 
the subroutine passes control to address 
180 hex. There, it turns on the carry flag 
with the STC (set carry flag) instruction 
before the program returns. 

All that is left is to place the necessary 
prompt in memory with Debug's Enter com- 
mand, display a section of memory to see 
how long the total program is, and save the 
program. (The prompt begins with a carriage 
return and line feed so it is always displayed 
on a new line, even if the user makes a mis- 
take and the program starts over.) 

After you save the program, you will 
undoubtedly want to return to MS-DOS 
and run it. If it doesn't work correctly, re- 
enter Debug, load the program, and trace 
through it. (Remembet not to trace through 
rhe interrupt calls.) Debugging and tracing 
are a necessary part of writing in assembly 
language, because almost every program 
has bugs in it at first. 



What Next? 

If you have enjoyed this short introduc- 
tion to assembly language, you will want 
to experiment with your own ideas, write 
more complex programs, and learn to use 
the full CPU instruction set. To do this, 
you will need documentation of both the 
MS-DOS services and the BIOS interrupts, 
as well as the full CPU instruction set, 
There are many good books on assembly 
language that have both. 

Vou will probably become frustrated with 
Debug.COM's limitations and want a bet- 
ter assembler and tracing utility. The stan- 
dard assembler, and the one used in most 
magazine articles, is Microsoft's Macro 
Assembler (MASM). Newer versions of 
MASM have a debugging program called 
Symdeb, which is a large step up from 
Debug.COM. 

The best way to learn, though, is to study 
and experiment with short programs. You 
might find that a well-commented assembly 
language program is at least as easy to 
understand as a program of similar length 
written in a high-level language. You will 
gain insight into your computer that will 
make you a better programmer— no matter 
which high-level language you choose. □ 

HARDIN BROTHERS is a freelance preg-ammer and 
technical writer. Write to him at 280 N. Campus Aoe, , 
Upland, CA 917%. Enclose a self-addressed, stamped 
envelope for a reply. You can also contact Hardin through 
Compuserve's Eosyplex at 70007,1150. 
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digit. - grams. 
BIOS—Basic input/output system. A set K— Kilobyte, or 1,024 bytes. K is an ab- 
of subroutines, generally in ROM, pro-^ bre via tion of kilo, which means 1,000; the 
vided by the computer manufacturer for actual number equals 2 to the 10th power, 
communication with the computer's hard- math coprocessor— A special CPU de- 
ware. DOS uses the BIOS extensively, signed to perform floating-point calcula- 
and some BIOS routines, especially those tions at top speed. IBM PC/XT 
related to the video display, are useful to compatibles often use the Intel 8087 
programmers, as well. CPU, while AT-class computers often use 
byte— Eight bits grouped asalogical unit, the 80287. 

Data is usually stored in bytes, and ma- MB— Megabyte, or 1,048 ,5 76 bytes. Mega 
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address— The location in memory 80186, and AT-type machines use the memory segments are divided into para- 
(ROM or RAM) of a byte or word. In 802.86, The CPU is sometimes also called graphs, 

MS-DOS computers, each address can ; tadmain processing unit (MP U). RAM— Random-access memory (a mis- ' 

be represented as a unique 20-bit value. D©S-©isk operating system. A series nomer). RAM is memory that can be 
There are 1,048, 5 76 + ( 1MB) addresses, of routines that allow programmers to changed by the CPU, Generally, the term 
bit— The smallest unit of data. It is rep-, ccwifriCnicate with the disks, keyboard, refers to the memory that is allocated to 
resented in the computer as a + 5 or zero screen, and other parts of the computer's MS-DOS and programs. MS-DOS com- 
voltage, and it is often written as 1 or hardware. The DOS is also responsible puters can have a maximum of 64 OK or 
zero. The word is a contraction of binary for loading, running, and ending pro- 704K of RAM, depending on the system 

•configuration. 

register—A memory , location in the 
CPU. In the 8088 family, each register 
can hold one word, or 16 bits, of data. 
The four general-purpose registers (AX, 
BC, CX, and DX) can also be addressed 
as 8-byte registers. 

ROM— Read-only memory. ROM is 
memory that has permanent program 
code "burned" into it at the factory. The 
CPU can execute the programs in ROM, ■ 
but it cannot alter ; the program code. 
ROM contains the computer's boot-up . 
code, BIOS routines, and, on IBM com- 
puters, part of Basic, 
segment— The most significant 16 bits 
in a 20-bit memory address. 



chine-language instructions are 1 or more 

bytes long. 

CPU— The central processing unit, or 

main chip, of a personal computer. Most 
JM PC/XT compatibles use the Intel 
088 or 8086 CPU. The Tandy 2000 and ! 



means 1,000,000; the number equals 2 
to the 20th power. 

nibble— Half a byte, or 4 bits, that are 
thought of as a logical unit, 
offset— The least significant 16 bits in a 
20-bit memory address. 
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